Analysis, clustering and prediction of the conformation of short and medium size loops connecting regular secondary structures.
نویسندگان
چکیده
Loops are regions of non-repetitive conformation connecting regular secondary structures. They are both the most difficult and error prone regions of a protein to solve by X-ray crystallography and the hardest regions to model using knowledge-based procedures. While the core of a protein can be straight forwardly modelled from the structurally conserved regions of homologues of known structure, loops must be modelled from a selected homologue or from a loop chosen from outside the family. Here we present a loop prediction procedure that attempts to identify the conformational class of the loop rather than to select a specific loop from a database of fragments. The structures of some 2083 loops of one to eight residues in length were extracted from a database of 225 protein and protein domain structures. For each loop, the relative disposition of its bounding secondary structures is described by the separation between the tips of their axes, the angle and dihedral angle between their axes. From the clustering of the loops according to the root mean square deviation of their spatial fit, a total of 162 loop conformational classes, including 79% of loops, were identified. One-hundred and eight of these, involving 66% of the loops, were populated by at least four non-homologous loops or four loops sharing a low sequence identity. Another 54 classes, including 13% of the loops, were populated by at least three loops of low sequence similarity from three or fewer non-homologous groups. Most of the previously described loop conformations were found among the populated classes. For each class a template was constructed containing both sequence preferences and the relative disposition of bounding secondary structures among member loops. During comparative modelling, the conformation of a loop can be predicted by identifying a loop class with which its sequence and disposition of bounding secondary structures are compatible.
منابع مشابه
PreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars
Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...
متن کاملPrediction of Secondary Structure of Citrus Viroids Reported from Southern Iran
Abstract Viroids are smallest, single-stranded, circular, highly structured plant pathogenic RNAs that do not code for any protein. Viroids belong to two families, the Avsunviroidae and the Pospiviroidae. Members of the Pospiviroidae family adopt a rod-like secondary structure. In this study the most stable secondary structures of citrus viroid variants that reported from Fars province wer...
متن کاملBrowsing the SLoop database of structurally classified loops connecting elements of protein secondary structure
We describe a web server, which provides easy access to the SLoop database of loop conformations connecting elements of protein secondary structure. The loops are classified according to their length, the type of bounding secondary structures and the conformation of the mainchain. The current release of the database consists of over 8000 loops of up to 20 residues in length. A loop prediction m...
متن کاملFast Protein Loop Sampling and Structure Prediction Using Distance-Guided Sequential Chain-Growth Monte Carlo Method
Loops in proteins are flexible regions connecting regular secondary structures. They are often involved in protein functions through interacting with other molecules. The irregularity and flexibility of loops make their structures difficult to determine experimentally and challenging to model computationally. Conformation sampling and energy evaluation are the two key components in loop modelin...
متن کاملProtein short loop prediction in terms of a structural alphabet
Loops connect regular secondary structures. In many instances, they are known to play crucial biological roles. To bypass the limitation of secondary structure description, we previously defined a structural alphabet composed of 16 structural prototypes, called Protein Blocks (PBs). It leads to an accurate description of every region of 3D protein backbones and has been used in local structure ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره شماره
صفحات -
تاریخ انتشار 1996